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Abstract 

Ito's construction of Markovian solutions to stochastic equations driven by a 
Levy noise is extended to nonlinear distribution dependent integrands aiming at 
the effective construction of linear and nonlinear Markov semigroups and the corre- 
sponding processes with a given pseudo-differential generator. It is shown that a con- 
ditionally positive integro-differential operator (of the Levy-Khintchine type) with 
variable coefficients (diffusion, drift and Levy measure) depending Lipschitz con- 
tinuously on its parameters (position and/or its distribution) generates a linear or 
nonlinear Markov semigroup, where the measures are metricized by the Wasserstein- 
Kantorovich metrics. This is a nontrivial but natural extension to general Markov 
processes of a long known fact for ordinary diffusions. 
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1 Introduction and formulation of main results 

By C(R n ) (respectively C tX) (R r1 )) we denote the Banach space of continuous bounded 
functions on R n (respectively its subspace of functions vanishing at infinity) with the 
sup-norm denoted by || • ||, and C fc (R n ) (resp. C*(R n )) denotes the Banach space of 
k times continuously differentiate functions with bounded derivatives on R n (resp. its 
subspace of functions with a compact support) with the norm being the sum of the sup- 
norms of a function and all its partial derivative up to and including the order k). 
For an / G C 1 (R n ) the gradient will be denoted by 

V/ = (V 1 /,...,V„/) = (^,...,^). 



* submitted to PTRF September 2008 

^Department of Statistics, University of Warwick, Coventry CV4 7AL UK, Email: 
v . kolokolt sov@ Warwick .ac.uk 



f 



For a measure v and a mapping F we denote by v F the push forward of v with respect 
to F defined as u F (A) = u(F~ 1 (A)). 

Further basic notations: 1 m is the indicator function of a set M, M. (R d ) is the set of 
finite positive Borel measures on R d , B r is the ball of radius r centered at the origin, and 
the pairing (/,//) for / G C(R d ), // G .M(R d ) denotes the usual integration. The bold 
letters E and P will denote expectation and probability. A positive number in the square 
bracket, say [x], will denote the integer part of it. By the small letter c we shall denote 
various constants indicating in brackets (when appropriate) the parameters on which they 
depend. 

It is well known (the Courrege theorem, see e.g. [7]) that the generator L of a 
conservative (i.e. preserving constants) Feller semigroup in R d is conditionally positive 
(/ > 0, f(x) = =>- Lf(x) > 0) and if its domain contains the space C^(R d ), then it 
has the following Levy-Khintchine form with variable coefficients: 

Lf{x) = ±(G(x)V,V)f(x)+(b(x),Vf(x))+ J (f(x+y)-f(x)-(Vf(x),y)l Bl (y))v(x,dy), 

(1) 

where G(x) is a symmetric non-negative matrix and v{x, .) a Borel measure on R d (called 
Levy measure) such that 

/ min(l,M>(x;d?/)<oo, v({0}) = 0. (2) 

The inverse question on whether a given operator of this form (or better to say its closure) 
actually generates a Feller semigroup is nontrivial and attracted lots of attention. One can 
distinguish analytic and probabilistic approaches to this problem. The existence results 
obtained by analytic techniques require certain non-degeneracy condition on v, e.g. a 
lower bound for the symbol of pseudo-differential operator L (see e.g. [2], [3], [7]- [12] and 
references therein), and for the construction of the processes via usual stochastic calculus 
one needs to have a family of transformations F x of R d preserving the origin, regularly 
depending on x and pushing a certain Levy measure v to the Levy measures u(x, .), i.e. 
i/(x, .) = u Fx (see e.g. pQ, [1], [12]). Of course yet more nontrivial is the problem of 
constructing the so called nonlinear Markov semigroups solving the weak equations of the 
form 

^(f,fH) = (L IH f, fH ), [i t G V(R d ), l i Q = n, (3) 

that should hold, say, for all / G C^(Tl d ), where has form (CQ), but with all coefficients 
additionally depending on /i, i.e. 

LJ(x) = i(G(x,/i)V, V)/(x) + (b(x,fi),Vf(x)) 

+ J(f(x + y)-f(x)-(Vf(x),y)l Bl (y))v(x,v,dy). (4) 

Equations of type ([3]) play indispensable role in the theory of interacting particles (mean 
field approximation) and exhaust all positivity preserving evolutions on measures subject 
to certain mild regularity assumptions (see e.g. [13], [19]). A resolving semigroup U t : /i i— > 
fi t of the Cauchy problem for equation ([3]) specified a so called generalized or nonlinear 
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Markov process X(t), whose distribution fi t at time t can be determined by the formula 
Ut^ s fi s from its distribution /x s at any previous moment s. 

In the case of diffusions (when v vanishes in (fill or (11])) the theory of the corresponding 
semigroups is well developed, see [T7] and more recent achievements in [Bj. Also well 
developed is the case of pure jump processes, see e.g. the treatment of the Boltzmann 
equation (spatially trivial) in [20J. 

The goal of the present paper is to exploit the idea of nonlinear integrators (see [5], 
|16j ) combined with a certain coupling of Levy processes in order to push forward the 
probabilistic construction in a way that allows the natural Lipschitz continuous depen- 
dence of the coefficients G, b, v on x, \i with measures equipped with their Wasserstein 
metric (see the definition below). Thus obtained extension of the standard SDEs with 
Levy noise represent a probabilistic counterpart of the celebrated extension of the Monge 
mass transformation problem to the generalized Kantorovich one. To streamline the ex- 
position we shall use Ito's approach (as exposed in detail in [19]) for constructing the 
solutions of stochastic equations directly via Euler approximation scheme bypassing the 
theory of stochastic integration itself. Roughly speaking the idea is to approximate a 
process with a given (formal) generator (or pre-generator) by processes with piecewise 
Levy paths. 

For a random variable X we shall denote by C(X) the distribution (probability law) 
of X. Recall that the so called Wasserstein-Kantorovich metrics W p , p > 1, on the set of 
probability measures P(R d ) on R d are defined as 

W p (u h u 2 ) = Anf J \y x -y2\ p v{dy 1 dy 2 ) S j , (5) 

where inf is taken over the class of probability measures v on R 2d that couple v\ and v 2 , 
i.e. that satisfy 




(Mvi) + 4>2{V2))y{dy\dy 2 ) = (0i, u x ) + (02, v 2 ) (6) 

for all bounded measurable 0i,02- It follows directly from the definition that 

W^,v')<E\\X-X'r (7) 

whenever \i = C(X) and // = C(X'). 

For random variable x, z we shall write sometimes shortly W p (x, z) for W p (C(x), C(z)) 
(with some obvious abuse of notation). 

It is well known (see e.g. [22]) that (V(R d ),W p ), p > 1 is a complete metric space 
and that the convergence in this metric space is equivalent to the convergence in the weak 
sense combined with the convergence of the pth moments. In case p = 1 the celebrated 
Monge-Kantorovich theorem states that 

W 1 (fi 1 ,n 2 )= sup |(/,/ii) - (f,H 2 )\, 
feiip 

where Lip is the set of continuous functions / such that \f(x) — f(y)\ < \\x — y\\ for all 
x,y. 
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We shall need also the Wasserstein distances between the distributions in the Skorohod 
space _D([0,T],R d ) of cadlag paths in R d defined of course as 

W P:T (X u X 2 )=mi(Esnp\X 1 (t)-X 2 (t)\^ '\ (8) 

where inf is taken over all couplings of the distributions of the random paths Xi,X 2 . 
Notice that this distance is linked with the uniform (and not Skorohod) topology on the 
path space. 

To compare the Levy measures, we shall need an extension of these distances to 
unbounded measures. Namely, let M. p (R d ) denote the class of Borel measures // on 
R d \{0} (not necessarily finite) with a finite p-th moment (i.e. such that J \y\ p [i(dy) < oo). 
For a pair of measures ^1,^2 from _M p (R d ) we define the distance ^(^1,^2) by ([5]), 
where inf is now taken over all v £ A4 p (Tl 2d ) such that holds for all (pi, $2 satisfying 
(f>i(.)/\.\ p G C(R d ). It is easy to see that for finite measures this definition coincides with 
the previous one and that if measures z/i and z/ 2 are infinite, the distance W p {v\ ) V2) is 
finite. 

Moreover, by the same argument as for finite measures (see [18] or [22]) one shows 
that whenever the distance W p (v\ ) V2) is finite, the infimum in ([5]) is achieved, i.e. there 



exists a measure v e _M P (R M ) such that 



Wp(//i,// 2 ) = (J \yi - y 2 \ p v{dyidy 2 )\ . (9) 
Theorem 1.1 Let an operator L have form ([T]) ; where 

\WG{x l )-^G{x 2 )\\^b{x 1 )-b{x 2 )\+W 2 {i B ^)v{x l -.),i B ^)v{x 2 -.)) < «||xi-x 2 || (10) 

with a certain constant k, and 

sup ( y/G(x) + \b(x)\ + / \y\ 2 v{x,dy) J < 00. (11) 

Let the family of finite measures {l R d\ Bl )(.)z/(s; .)} be uniformly bounded, tight and de- 
pend weakly continuous on x. Then L extends to the generator of a conservative Feller 
semigroup. 



Remarks. 1. The boundedness condition fflT]) is not essential and can be dispensed 
with by the usual localization arguments, see [15] . 2. Once the well posed-ness of the 
equations generated by L is obtained, it implies various extensions of the results on the 
corresponding boundary value problems, problems with unbounded coefficients, fractional 
dynamics or Malliavin calculus (see |21| . |12j . |14j . [1]) obtained earlier for particular cases. 

For example, assumption on v is satisfied if one can decompose the Levy measures 
v[x\ .) in the countable sums v{x\ .) = YlnLi v n{ x ] ■) of probability measures so that 
W2(vi(x; .), Vi{z\ .)) < cii\x — z\ and the series X^ a i converges. It is well known that 



1 Let a decreasing sequence of positive numbers be defined by the condition that v\ can be de- 
composed into the sum v\ = X^^Li v \ °f the probability measures ^™ having the support in the closed 
shells {x € R d : e„ < \x\ < e^_i} (where ej = 00). Similarly Q and v% are defined. Then the sum 
v = Y^=i v \ ® v 2 i s a coupling of v\ and v% with a finite / \y\ — y2\ p v{dyxdy-i). 
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the optimal coupling of probability measures (Kantorovich problem) can not always be 
realized via a mass transportation (a solution to the Monge problem), thus leading to the 
examples when the construction of the process via standard stochastic calculus would not 
work. On the other hand, no non- degeneracy is build in this example leading to serious 
difficulties when trying to apply analytic techniques in these circumstances. 

Another important particular situation is that of a common star shape of the measures 
u(x; .), i.e. if they can be represented as 

v(x;dy) = u(x,s,dr)u(ds), y G R d , r = \y\ G R+, s = y/r G S"* -1 , (12) 

with a certain measure oj on S and a family of measures u(x, s, dr) on R + . This allows 
to reduce the general coupling problem to a much more easily handled one-dimensional 
one, because evidently if v x ,y,s(.dridr 2 ) is a coupling of u(x,s,dr) and u(y,s,dr), then 
v x ,y,s(dr\dr2)uj(ds) is a coupling of v{x\ .) and v(y; .). If one-dimensional measures have 
no atoms, their coupling can be naturally organized via pushing along a certain mapping. 
Namely, the measure v F is the pushing forward of a measure v on R + by a mapping 
F : R + I— > R + whenever 

r f(F(r))u(dr) = J f(u)v F (du) 

for a sufficiently rich class of test functions /, say for the indicators of intervals. Suppose 
we are looking for a family of monotone continuous bijections F XjS : R + i— > R + such that 
v Fx,s _ i/(x,s,.). Choosing / = 1[f( z ),oc) as a test function in the above definition of 
pushing yields 

G(x,s,F XfS (z)) = v([z,oo)) (13) 

for G(x,s,z) = u(x, s, [z, oo)) = J u(x, s,dy). Clearly if all u(x,s,.) and v are un- 
bounded, but bounded on any interval separated from the origin, have no atoms and do 
not vanish on any open interval, then this equation defines a unique continuous monotone 
bijection F x s : R + > R + with also continuous inverse. Hence we arrive to the following 
criterion. 



Proposition 1.1 Suppose the Levy measures u(x; .) can be represented in the form ( fl2l) 
and v is a Levy measure on R + such that all v{x,s,.) and v are unbounded, have no 
atoms and do not vanish on any open interval. Then the family u(x; .) depends Lipshitz 
continuous on x in W2 whenever the unique continuous solution F XyS (z) to (TlBl is Lipschitz 
continuous in x with a constant kf(z,s) enjoying the condition 



Kp(r, s)u(ds)i/(dr) < 00. (14) 

R+ J 3 d - 1 

Proof. By the above discussion the solution F specifies the coupling v x ^ y (dridr2dsids2) of 
u(x; .) and u(y; .) via 

j f(r 1 ,r 2 ,s 1 ,s 2 )v x ,y(dr 1 dr2ds 1 ds2) = J f(F x>s (r), F y>s (r), s, s)u(ds)i/(dr), 

so that for Lipschitz continuity of the family v(x\ .) it is sufficient to have 

I [ {F XtS - F y , s fu{ds)v{dr) < c{x - y)\ 
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which is clearly satisfied whenever (|T4|) holds. 

The particular case of is(x, s, .) above having densities with respect to the Lebesgue 
measure on R + is discussed in much detail in [T9] . 

The point to make here is that a coupling for the sum of Levy measures can be 
organized separately for each term allowing to use the above statement for star shape 
components and, say some discrete methods for discrete parts. 

Theorem II .11 is a straightforward corollary of our main theorem that we shall formulate 
now. To make our exposition more transparent we shall present the main arguments in 
the case of having the form 

L„f(x) = i(G(x,/i)V, V)f(x) + (b(x,fj,), V/(x))+ J {f{x+z)-f{x)-{Vf(x),z))v(x,r,dz) 

(15) 

with u(x, [A] .) G A4 2 (R d ). Let Y T (z, /x) be a family of Levy processes depending measur- 
ably on the points z and probability measures /i in R d and specified by their generators 

L[z, v\f{x) = ±(G(z, /i) V, V)/(x) + {b(z, /i), V/(*)) 

+ J (f(x + y)~ f(x) - (V/(z), y))v(z, fx; dy) (16) 

where u(z, li) G M.2(R d ). Under the conditions of Theorem 11.21 given below, the existence 
of such a family follows from the well known randomization lemma H (see e.g. [8], Lemma 
3.22), because by Proposition IA. II the mapping from z, \x to the law of the Levy process 
Y T (z, (i) is continuous, hence measurable, and consequently, by this Lemma (with Z being 
the complete metric space D(R + , R d ), and hence a Borel space) one can define all Y T (z, li) 
on the single standard probability space [0, 1]. Let us stress for clarity that the processes 
Y T (x,fi) depend on x,fi only via the parameters of the generator, i.e., say, the random 
variable £ = x + Y T (x, C(x)) has the characteristic function 

Our approach to solving ([3]) is via the solution to the following nonlinear distribution 
dependent stochastic equation with nonlinear Levy type integrators: 

X(t)=X+ [ dY s (X(s),jC(X(s))), C(X) = n, (17) 

with a given initial distribution \x and a random variable X independent of Y T (z,fi). 

We shall define the solution through the Euler type approximation scheme, i.e. by 
means of the approximations X^: 

x;(t) = x;(h) + Y t i lT (x;(h), c(x;(ir))), c(x;(o)) = M , (is) 

where It < t < (I + l)r, I = 0,1,2,..., and Yj(x,fi) is a collection (depending on I) 
of independent families of the Levy processes Y T (x,fi) introduced above. Clearly these 
approximation processes are cadlag. 

2 It states that if /x(x, dz) is a probability kernel from a measurable space X to a Borel space Z, then 
there exists a measurable function / : X x [0, 1] — > Z such that if is uniformly distributed on [0, 1], then 
f(X,8) has distribution fi(x, .) for every x € X. 
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For x G R d we shall write shortly X^(kr) for X^(kr). 

By the weak solution to (TTTj) we shall mean the weak limit of XJf, r k = 2~ k , k — > oo, 
in the sense of the distributions on the Skorohod space of cadlag paths (which is of 
course implied by the convergence of the distributions in the sense of the distance flS])). 
Alternatively one could define it as a solution to the corresponding nonlinear martingale 
problem (see below the proof of the main theorem) or directly via the construction of the 
corresponding stochastic integral. This issue is addressed in detail in [15], our purpose 
here being the construction of a Markov process with a given generator. 

The following is our main result. 

Theorem 1.2 Let an operator L. have form ffTBI) . Moreover 

\\y/G(x, n)-y/G(z,rj)\\ + \b(x, fi)-b(z,r})\+W 2 (v(x, \i\ .),v(z,r)i .)) < k(\x-z\+W 2 (ji, rj)), 

(19) 

holds true with a constant k and 

sup I a/ G(x, Jt) + \b(x, fi) \ + / \y\ 2 i , (x, fi,dy) J < oo. (20) 

Then 

(i) for any \i G V(R d ) nA4 2 (R d ) the approximations XJf converge to a process X^(t) 
in the sense that 

sup sup Wi{X T *{[tlT k \T k ,X^t)) <c(t Q )r k (21) 
a* te[o,t ] 

for any t , and even stronger 

supW 2 2 it0 (X^,Xj <c(t )r k] (22) 

A* 

(ii) the distributions fi t = C(X^{t)) depend 1/2-Hdlder continuous on t in the metric 
W 2 and X^{t) depend Lipschitz continuously on the initial condition in the following sense: 

sup W^X,(t),X v (t)) < c{t Q )W^ri)- (23) 
te[o,to] 

(Hi) the processes 

M(t) = fiX.it)) - fiX°) - fiL^x^fiX.is)) ds (24) 



are martingales for any f G C 2 (R d ); in other words, the process X.it) solves the corre- 
sponding (nonlinear) martingale problem; 

(iv) the distributions fi t = £(X /J (t)) satisfy the weak nonlinear equation (j3J) (that holds 
for all f EC 2 iR d )); 

(v) the resolving operators U t : \x i— ► fit of the Cauchy problem form a nonlinear 
Markov semigroup, i.e. they are continuous mappings from V(R. d ) H Ai 2 (R. d ) (equipped 
with the metric W 2 ) to itself such that Uq is the identity mapping and U t+S = U t U s for all 
s, t > 0. If L[z,fj] do not depend explicitly on \i the operators T t /(x) = Ef(X x (t)) form 
a conservative Feller semigroup preserving the space of Lipschitz continuous functions. 
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This theorem is proved in the next section. In Sections 3 we obtain some regularity 
criteria for the Markov semigroups constructed. 

A simple meaningful example is given by the nonlinear kinetic equations 

j t {f, fi t ) = {Lf, Mt ) + J (K{x, y), Vf{x))iH{dx)iH{dy), (25) 

with L being of form ([1]) with Lipschitz continuous coefficients and K being a bounded 
Lipschitz continuous mapping R 2d \— > R d , which arise as the mean-field limit for potentially 
interacting Feller processes. 

Theorem 11.11 follows now from Theorem 11.21 by the standard perturbation theory, 
since dividing the generator into two parts, where the first part is the integral term with 
the Levy measure reduced to H d \ B\, one gets a sum of two generators, one of which 
is bounded in C o(R' i ) (as follows from the assumed tightness) and the other satisfies 
Theorem 11.21 

It is worth noting that in a simpler case of generators of up to the first order the 
continuity of Levy measures with respect to a more easy handled metric W\ is sufficient, 
as shows the following result, whose proof is omitted (as being a simplified version of the 
proof of Theorem 11.21) . 

Theorem 1.3 Let an operator L^ have the form 
L,f(x) = {b(x, fi, V/(z)) + j (f(x + z)- f(x)Hx, k dz), v{x, /i; .) G A^(R d ). (26) 
and 

\\b{x,fi)-b{z,r ] )\\+W 1 {p{x,fi;.),u{z,r r ,-)) < k{\\x - z\\ + Wxiji, rj)) (27) 

holds true with a constant k. Then for any \x G V(R d ) n Mi(R d ) there exists a process 
Xfj,(t) solving ( TT7|) (with analogously defined Y T (z, fi)) such that 

sup w 1M (x;\x,) < c(t )n, (28) 

the distributions fi t = C(X(t)) depend 1/2 -Holder continuous on t in the metric W\ and 
X^{t) depend Lipschitz continuously on the initial condition in the following sense: 

W 1 {X lx {t),X 11 {t)) < c{t )W 1 (^,r 1 ). (29) 

Moreover, the processes (I24p are martingales for any f G C 1 (R d ) and the distributions 
fit = C(X^(t)) satisfy the weak nonlinear equation ([3]) (that holds for all f G C 1 (R d ) ). If 
L[z, fi] do not depend explicitly on \i the operators T t f(x) = Ef(X x (t)) form a conservative 
Feller semigroup. 

In Appendix we describe a coupling of Levy processes that is crucial for our purposes. 

2 Proof of Theorem O 

Step 1 (uniform continuity of the approximations with respect to initial data). 
One has 

W 2 2 (x x + Y s (x u C(x 1 )),x 2 + Y s (x 2 , C(x 2 ))) < E(^i - 6) 2 
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for any random variable (£1,62) with the projections = x, + K s (xj,/ii), /ij = C(xi), 
i = 1,2. Let us choose the coupling described by the characteristic function 

Ee i(pifc-H»&) = /" e ipi(:ci+s/l)+tp2(:E2+y2 V(^irfx 2 )P x s li:!;2imM2 (rfi/irfi/ 2 ), 

where /x is an arbitrary coupling of the random variables Xi,x 2 and P s is the coupling of 
the Levy processes Y s (xi, /jj) given by Proposition IA.1I Consequently, 

E(6-6) 2 = -^l P =o Ee*<*-&> 
[Oi + Vx) ~ O2 + y2)} 2 fi{dx l dx 2 )P^ uX2ifllifl2 {dyidy 2 ), 

R 4d 

which by fj46|) does not exceed 

((xi - x 2 ) 2 + cs[(xi - x 2 ) 2 + W 2 2 (£(xi), £(x 2 ))]) n(dx 1 dx 2 ). 



(30) 



/ R 2d 

Consequently, by (JTj), 

E(6~6) 2 < / (l + 2cs)(xi -x 2 fii{dx 1 dx 2 ). 

J-R? d 

Hence, taking infimum over all couplings, yields 

Wf (xx + y a (x 1? £(x x )), x 2 + Y s (x 2 , C{x 2 ))) < (1 + 2c S )W 2 2 (£(x 1 ), C{x 2 )). (31) 
Applying this inequality inductively, yields 

w 2 2 (x;(s),x;(s)) < e^w^, v ) (32) 

with a constant c uniformly for all r < 1, s > 0, /x, r\ G P(R a! ) fl .M 2 (R d ). 
Step 2 (subdivision and the existence of the limit). 
We want to estimate the W2 distance between the random variables 

fl = X + K r (x, /i) = x' + ^(X, /i), £ 2 = / + V)> 

where the families K s and Y,' are independent, 

a/ = x + Y r/2 (x,fj l ), z' = z + Y r/2 (z,r]), 

and /i = £(x), 77 = £(2), 77' = £(2'). We shall couple £1 and £ 2 using sequentially 
Proposition IA.11 Namely, we shall define it by the equation 

E/(fi, 6) = / /(a? + vi + 1/1, z + v 2 + y 2 )[i[dxdz)P T Jl (dv^dv 2 )Pl^, ,{dy x dy 2 ) 

for / G C(R 2d ), where, say, P T J 2 is the coupling of the Levy processes YL 2 (x,/i) and 
Y^ 2 (z',r]') with = 2; + t> 2 given by Proposition IA.1I (note that the probability law 7/ is 
the function of z,r]). 
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Now by (|46]) 

wf(6,6)<E(£i-6) 2 

< E(x' - z') 2 + cr[E(x' - z'f + E(x - z f + W/f(/i, 77')]. 
Hence, by flU and © W|(fi>6) does not exceed 

W 2 2 (x, z)(l + 2cr)(l + cr) + 2crE(x - ^') 2 

and consequently also 

W 2 (x, z){l + 2cr) + 4ctE(Y t/2 (z, r,)) 2 

(with another constant c) so that 

W*(tut2)< W*{x,z){l + cr) + cr 2 

(with yet another c), because the second moments of our processes Y T are bounded due 
to assumption (1201) . Consequently 

W|(X;(fcr),X;/ 2 (fcr)) < cr 2 + (1 + cr)W 2 (X;((k - l)r),X;/ 2 ((fc - l)r)). (33) 

By induction one estimates the l.h.s. of this inequality by 

r 2 [l + (1 + cr) + (1 + cr) 2 + ... + (1 + cr)^] < c~ l r(l + cr) k < c(t )r. 

Repeating this subdivision and using the triangle inequality for distances yields 

W 2 2 (X;(kr),X;/ 2m (kr))<c(t )T. 

This implies the existence of the limit X x k ([t/Tk\rk), as k — > 00, in the sense of fl2T]) . 

Observe now that fl32|) implies fl23|) . Moreover, the mapping T t f(x) = Ef(X x (t)) 
preserves the set of Lipschitz continuous functions. In fact, if / is Lipschitz with the 
constant h, then 

m(K(Wr]r)) - Ef(X:([t/r]r))\ < hE\\X;([t/r]r) - (XJ([t/r]r) || 

<h{E\\Xl{[tlr]r)-{Xl{[t/r]r)\\ 2 ) 1/2 . 
for any coupling of the processes X T X and X T Z . Hence by (132|) 

\Ef(X:([t/r]r)) - Ef(Xl([t/r}r))\ < hc(t )W 2 (x , z) . 

In particular, T t preserves constant functions. Similarly one shows (first for Lipschitz 
continuous / and then for all / G Coo(R. d ) via standard approximation) that 

sup sup \Ef(X?{[t/r k )T k )) - Ef(X x (t))\ - 0, k - 00, (34) 

tg[0,t ] x 

for all / G Coo(R d ). Moreover, as the dynamics of averages of the approximation pro- 
cesses clearly preserve the space C (X3 (R ), the same holds for the limiting mappings T t .. 
Consequently T t f = Ef(X x (t)) is a positivity preserving family of contractions in C(R d ) 
that preserve constants and the space C 00 (R d ). Hence the mappings U% : // 1— > m form a 



10 



(nonlinear) Markov semigroup, and if L[z,p] do not depend explicitly on \i y the operators 
T t f(x) = Fjf(X x (t)) form a conservative Feller semigroup. The Markov (or semigroup) 
property of the solutions follows from the construction (a detailed discussion of this fact 
in a similar situation is given in [T5]). 
From the inequality 

W^C(X;(It)),C(X;((1 - 1) T ))) < E [Y^\X;((1 - l)r),C(X;((l - l)r)))] 2 < cr 

it follows that the curve \i t depends l/2-H61der continuously on t in W2. 
Step 3 (improving convergence and solving the martingale problem) 
The processes 

M T (t) = f(X;(t)) - f(X) - f L[X;([s/T}r)^l s/T] }f(X;(s))ds, n = C(X), (35) 

Jo 

where \i\ = £(X^(/r)), are martingales by Dynkin's formula, applied to Levy processes 
Y T (z, p). Our aim is to pass to the limit — > to obtain the martingale characterization 
of the limiting process. But let us first strengthen our convergence result. 

Observe that the step by step inductive coupling of the trajectories X^ and X'L used 
above to prove (1321) actually defines the coupling between the distributions of these random 
trajectories in the Skorohod space D([0, to], R d ) for any to, i.e. a random trajectory 
(XZ^X'L) in D([0, to], R 2d ). One can construct the Dynkin martingales for this coupled 
process in the same way as above for X^. Namely, for a function / of two variables with 
bounded second derivatives the process 

M T (t) = f(x;(t),x;(t))- [ Lj(x;(s),x;( s ))ds, » = c{xj, v = c(x v ), 

Jo 

is a martingale, where L t is the coupling operator fj43l) constructed from the Levy processes 
Y with parameters X^([t/r]r), fij t/r] and X T q ([t / t}t) , rjj t/r] . 

Choosing f(x, y) = (x — y) 2 leads to the martingale of the form 

(x;(t) - x:(t)f + fo(\)(x;{s) -xi{s)fds 

Jo 

(the estimate for the integrand follows from fj45|) and the assumed Lipschitz continuity of 
the coefficients of L). Applying the martingale property in conjunction with Gronwall's 
lemma yields 

supE(X;( S ) -Xl(s)f < c(t)E(X M (0) -X v (0)) 2 , (36) 

s<t 

giving another proof of (|32|) . Moreover, applying Doob's maximal inequality (with p = 2) 
to the vector-valued martingale of the form 

M T (t) = x;(t)-x;(t) + [ o(i)\x;( s )-x^ s )\ds 

Jo 

constructed from f(x, y) = x — y and using (|36|) yields 

Esup|M r ( S )| 2 < c(t)E(X M (0) -X u (0)) 2 , 

s<t 
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which in turn implies 

Esu P (x;( s ) -x;( s )) 2 < c (t)E(x M (o) -x,(o)) 2 . 

s<t 

This allows to improve (|32|) to the estimate of the distance on paths: 

W^X^X^f < c(T)Wi(^ V ). (37) 

Similarly one can strengthen the estimates for subdivisions leading to the convergence of 
the distributions on paths (1221 . 

Using the Skorohod theorem for the weak converging sequence of random trajectories 
Xjf (let us stress again that the convergence with respect to the distance ([H]) implies the 
weak convergence of the distributions in the sense of the Skorohod topology), one can put 
them all on a single probability space forcing the processes XJf to converge to X^ almost 
surely in the sense of the Skorohod topology. 

Passing to the limit r = rfc — ► in fl35l) . using the continuity and boundedness of / 
and Lf and the dominated convergence theorem allows to conclude that the martingales 
M T (t) converge almost surely and in L 1 to the martingale 

M(t) = f(X,(t)) - f(X) - [\L c{x ^ s)) f)(X,(s))ds, 

Jo 

in other words that the process X^(t) solves the corresponding (nonlinear) martingale 
problem. 

Step 4 [completion] 

To prove (J3J) one writes using the martingale properties of M(t): 
j t {f^ t ) = ^m-E(f,X ll (t + s)-X ll (t)) = hm-E j^' {L c{Xll{a)) f){X^a))ds 

= (L.J,^)) + lim-E [ t+S [(L c{x ^ s)) f)(X,(s)) - L m f(X„(t)] ds, 
implying (j3J) by the continuity of \x t . 

3 Regularity 

Discussing regularity we reduce our attention for simplicity to Feller processes. It is 
known (see e.g. [13]) that from the sufficient regularity of nonhomogeneous versions of 
these Feller processes one can naturally deduce the uniqueness and regularity for the 
corresponding nonlinear problems. 

By C\ iv (respectively C^) we shall denote the subspace of functions from C k (R d ) with 
a Lipschitz continuous derivative of order k (respectively with all derivatives up to order 
k vanishing at infinity). 

We shall discuss in detail only the first derivative. 
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Theorem 3.1 Assume the conditions of Propositions A. 4 and \A.5\ hold. Then the spaces 



C\ ip and C\ ip fl are invariant under the semigroup T t constructed above from the 
generator 

Lf(x) = ±{G{x)V,V)f(x)+{b(x),Vf{x)) + J (f(x+y)-f(x)-{Vf{x),y))v{x,dy), (38) 
and for any f e C\ ip , G L x n C ao (R d ) 

j t {T t f,<f>) = {LT t f,<f>), t>0. (39) 
Proof. First let us calculate V jEg(X^(kr)) for an arbitrary k and g G C\ ip (Tl d ). One has 



Eg(Xl(kr)) = g(x + z x + ... + z k )P T x {dz x )...P T k ^ (dz k ). 

As 

V^g{Xl{kr)) = jjmi(E ff (^(fcr)) - Eg{Xl{kr))) 
does not depend on coupling, one can write 

VjEg(Xl(kT)) — lim — / (g(x + hej + w x + ... + w k ) - g(x + v x + ... + v k )) 

h— >0 fl J 

Px+he x(dw 1 dvi)...P T ^, ,^k-i ,v-fe-i (dw k dv k ) 
= hm / I (Vc?(a; + i>i + ... + v k ), ej H ... + 

h^O J ll 



h 



+0(l)-{he j + W1-V1 + ... +w k - v k ) 2 ] 



h 

Pl +he3 , x (dw 1 dv l )...Pl +hej+i:t - 2i „ +E;rii Vm (dw k dv k ). 
The term with 0(1) vanishes as it can be rewritten by Proposition IA.1I as 

Una 0(1) i J (hej + w x -v x + ■■■ + w k „i - f fc -i) 2 (l + cr) 

and consequently, iterating this procedure as 

1 



lim fO(l)/i 2 (l +cr) k = 0. 

h->0 h 



Hence 



VjEg(X^(kT)) = lim \ [ {Vg(x + v t + ... + v k ), hej +w x -v x + ... + w k - v k ) 

h^O fl J 

P: +he . tX {dw x dv x )...Pl +hej+ ^ WmiX+ ^ Vm (dw k dv k ). 
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Assume now first that g is from the Schwartz space S^R^) so that Proposition I A. 51 applies 
and one can write 

/ (V£f(x + vx + ... +v k ), hej + wi - vi + ... + w k - v k )P T fc _! (dw k dv k ) 

^ V jfc #(x + Vl + ... + V k )(h ej +W X -V X + ... + W k ^ - Ufc-l)'*- 1 ^.! + zf) 

ifc,ifc-i=i 

QLfc-Ws+y*- 1 vA dzkdvk ) + 0(r)(tw fc _i - Wfc-i) 2 . 

J V 1 Z^m — 1 m '' 

Consequently, as the last term does not contribute to the limit h — > 0, and iterating this 
procedure one obtains 



GLi^) (^l^i )<%n K^+^i) (^2^2) • "^h-i v ( p +j^ 1 Vm ) ( dz k dv k), (40) 

which is the rigorous explicit form of the (a priori not clearly defined but intuitively 
appealing) expression 

EVg(x + F r °(x) + Y t \X(t)) + ... + Y T k -\X((k - l)r)) 

x + dY T k -\X((k-l)T)) \ ^ / + d_Yl{x) 



dX((k-l)r) J"\ dx 

Approximating arbitrary g by functions from the Schwartz space one can conclude that 
gOD holds for all g G Cl ip (R d ). 

We want to show now that these derivatives are Lipschitz continuous. To shorten the 
formulas let us do it for the case of d — 1 only. In this case 

VE^(X;(A;r)) = J Vg{x + v, + ... + v k )(l + z x )...(l + z k ) 

Q T Du{x){ d Zidv 1 )Q T Dv(x+Vl) (dz 2 dv 2 )...Q T D , Jdz k dv k ), (41) 

and by Proposition IA.4I one can write 

WgiXl^kr)) -Wg{Xl 2 {kr)) 
= J [Vg{x 1 + Vl + ... + v k )(l + zi)...(l + z k ) - Vg(x 2 +v x + ... + v k )(l + z x )...(l + ~z k )] 

QDu(x 1 ,x 2 )( dz l d ^ d ^l d Vl)QD^ xl+Vl , X2+ v 1 )( dz 2d^^ 

Writing 

Vg(xt +vi + ... + v k ){l + z x )...(l + z k ) - Vg(x 2 + vi + ... + v k ))(l + ^...(l + z k ) 
= (V#(xi + vi + ... + v k ) - Vg(x 2 + v x + ... + v k ))(l + Zl )...(l + z k ) 

+Vg(x 2 + V! + ... + V k )[(l + Zl )...(l + Z k ) - (1 + Z!)...(l + ~z k )l 
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and applying the Holder inequality to estimate the integral over each of these two terms 
yields the estimate 

\VEg(X^(kr)) - VEg(X T X2 (kr))\ < K[I 2 (k, Xl ,x 2 ) + I 2 (k,x h x 2 )} 

with k depending on the norm and the Lipschitz constant of Vg, where 

Io(k,x 1 ,x 2 ) = j[xx - x 2 + vt - vt + ... + v k - v k ] 2 \(l + zi) 2 ...(l + z k ) 2 

Q T Du( Xl , X2 )(dz 1 dS 1 dv 1 dv 1 )...Ql u{xi+ ^ 

and 

I 2 (k,x u x 2 ) = J [(l + Zl )...(l + z k )- (l + z l )...(l+~z k )} 2 
Q T Dv(x 1 ,x 2 )( dz i d hdv 1 dv 1 )...Q] Juixi+ ^^^ 

By (El 

I 2 (k, x h x 2 ) < (1 + cr)I 2 (k - 1, Xl , x 2 ) < ... < (1 + cr) fc (xi - x 2 f . 

It remains to estimate I 2 . It would be convenient here to introduce special notations for 
the products: 

Z k = (1 + z x )...(l + z k ), Z k = {\ + z x )...{\ + z k ). 

Now one can write 

Z k — Z k = Z k _i(l + z k ) — Z k _i{\ + z k ) = (1 + z k ){Z k _i — Z k _i) + (z k — z k )Z k _i, 
so that 

[Z k - Z k f = (1 + z k ) 2 (Z k ^ 1 - Z k -\f + (z k - z k ) 2 Z 2 k _ x + 2(1 + z k ) (z k - z k ) (Z fe _i - Z fe _i)Z fe _i. 
Plugging this into the expression for If yields 

I 2 (k, Xi, x 2 ) < (1 + cr)I 2 (k - 1, xi, x 2 ) + crl^(k - 1, x x , x 2 ) + cr j fi(Z fc _i - Z k _-^)Z k _ x 

where Q in the last integral is a function of Xi, x 2 , Vj, Vj such that 

\Q\ < c\\xi - x 2 + v 1 - v 1 + ... + - t>jfe-i||. 
Hence, applying to this last integral again the Holder inequality yields 

ll{k, xi, x 2 ) < (1 + cr)I 2 (k - 1, xi, x 2 ) + crl^k - 1, x 1: x 2 ), 
which taking into account the above bound for Iq rewrites as 

I 2 (k, Xi, x 2 ) < (1 + cr)l\{k - 1, xi, x 2 ) + ct( Xi - x 2 ) 2 
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with yet another c as long as t = rk remains bounded. Using this formula recursively 
implies 



I\{k, xi, x 2 ) < cr(xx - x 2 f{l + (1 + cr) + ... + (1 + erf) < c(kr)(xi - x 2 f . 

Consequently one obtains the uniform estimate 

|VE0(J£(*r)) - VE^(X; 2 (A;r))| < Kc{kr)\\ Xl - x 2 \\. 

Hence from the sequence of the uniformly Lipschitz continuous functions VE/(X^(s)), 
k — 1, 2, one can choose a convergent subsequence the limit being clearly VEf(X x (t)), 
showing that Ef(X x (t)) G C\ ip . The uniform convergence implies Ef(X x (t)) G C^nC^, 
whenever the same holds for /. 

To complete the proof of the theorem it remains equation (|39|) . But this is easy: for 
t = it follows by approximating / with /„ G C 2 (R d ) and then for arbitrary t it follows 
by the invariance of the class C\ ip under T t . 

Second derivative can be analyzed similarly, but the assumptions and calculations 
become essentially longer. 

A Coupling of Levy processes 

We describe here the natural coupling of Levy processes leading in particular to the 
analysis of their weak derivatives with respect to a parameter. Recall that by C^ ip we 
denote the subspace of functions from C fc (R d ) with a Lipschitz continuous derivative of 
order k. 

Proposition A.l Let Y l s , i = 1,2, be two Levy processes in R d specified by their gener- 
ators 

Lif{x) = i(G t V, V)f(x) + (fc, V/(x)) + J (f(x +y)- f{x) - (V/(x), y)Mdy) (42) 

with Vi G M 2 (Tl d ). Let v G M. 2 {Y(? ,d ) be a coupling of V\,v 2 , i.e. (jSJ) holds for all 4>ii4>2 
satisfying 0j(.)/|.| 2 G C(Tl d ). Then the operator 



Lf( Xl ,x 2 ) 



i(GiVi, Vi) + i(G 2 V 2 , V 2 ) + (y^V^Vi, V 2 ) 



f(xi,x 2 ) 



+(&i, Vi/(xi, x 2 )) + (b 2 , V 2 f{xi,x 2 )) 
+ / [f{ x i + Vi, x 2 + Vz) - f( x i, x 2 ) - ((y u Vi) + (y 2 , V 2 ))f(x 1 , x 2 )]u(dy 1 dy 2 ) (43) 



(where Vj means the gradient with respect to Xi) specifies a Levy process Y s in R 2d with 
the characteristic exponent 



-i 2 



Vx 1 ,x 2 (Pl,P2) = y/ G ( X l)Pl + V G ( X l)P2 + iH X l)Pl + ih i X 2)P2 



- / (e*^ 1 ^ - 1 - + y 2 p 2 ))v{d yi dy 2 ), 
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that is a coupling ofYJ-, Y s in the sense that the components ofY s have the distribution of 
Y} and Y 2 respectively. Moreover, if f( X \,x 2 ) = h(xi — x 2 ) with a function h G C 2 (H d ), 
then 

Lf( Xl ,x 2 ) = ±(( y /G~ 1 - v^) 2 V, V)h{x x - x 2 ) + (&! - b 2 , Vh)( Xl - x 2 ) 



+ J [h(x 1 -x 2 +y x - y 2 ) - h(x 1 - x 2 ) - (yt - y 2 , Vhfa - x 2 )]v(dy 1 dy 2 ). (44) 
In particular, if f(xi,x 2 ) = (xi — x 2 ) 2 , then 

Lf( Xl , x 2 ) = tr( - ^G~ 2 ) 2 + 2(bi - b 2 , Xl - x 2 ) + J ( yi - y 2 ) V(d yi dy 2 ). (45) 

Finally 

E(Z + Y t 1 -Y t 2 ) 2 = (Z + t(b 1 -b 2 )) 2 + t(Tr( v fG~ 1 - v fG~ 2 ) 2 + [ I \y x - y 2 ) 2 u(d yi dy 2 ) 



(46) 

Proof. Straightforward. In fact, clearly Y s couples Y},Y 2 , because say r} XliX2 (pi, 0) is 
the characteristic exponent of Y}. Equation ( T441 follows from ( )43i) . The second moment 
(|46|) is found either by twice differentiating the characteristic function, or by the Dynkin 
formula in conjunction with (jUJ). 
Similarly one obtains 

Proposition A. 2 Let Y l s , i = 1,2, be two Levy processes in R d specified by their gener- 
ators 

LJ(x) = (b t , V/(x)) + [ (f(x + y)- f(x))vi(dy) (47) 



with Vi G M.i(H d ). Let v G .Mi(R 2d ) be a coupling of v\,v 2 , i.e. flSJ) holds for all 0i,0 2 
satisfying ^(.)/|.| G C(R d ). Then the operator 

Lf(xi,x 2 ) = (6i, Vi/(xi,x 2 )) + (6 2 , V 2 /(xi,x 2 ))+ / [f(x 1 +y 1 ,X2+y 2 )-f(x 1 ,x 2 )]u(dy 1 dy 2 ) 

(48) 

specifies a Levy process Y s in R M that is a coupling ofY}, Y 2 such that for all t 

BU + Y^-Y 2 ]] < \\£\\ + t(\\b 1 -b 2 \\ + j j Wyt-y^Wyxdy^. (49) 

Proof. One approximates \y\ by a smooth function, applies Dynkin's formula and then 
passes to the limit. 

Next, let Y t (z) be a family of Levy processes in R^ parametrized by points z G R d 
and specified by their generators 

L[z]f(x) = ~(G(z)V, V)f(x) + (b(z), Vf(x)) + J (f(x + y) - f(x) - (Vf(x),y))v(z; dy) 

(50) 

where v{z\ .) G .M 2 (R d ). We are interested in defining the process ^Y t (z). 

17 



We shall describe this process via a certain derivative type operator on Levy measures 
connected with a coupling. Namely, let v Xl , X2 {dyidy 2 ) be a family of A^-couplings of 
v(x\\ .), v(x 2 ; , ■) (in the sense that v Xl)X2 G M.2(R 2d ) and (J6]) holds for all 4>i,4> 2 such 
that 0j(.)/|.| 2 G C(R d )). For instance, these could be optimal couplings with respect to 
the cost function (yi — y 2 ) 2 , i.e. those couplings, where the infinum in the definition of 
W 2 {v{x\, .), v(x 2 , •)) IS attained. 

Let T h (yi,y 2 ) = ((yi — 2/2)/^? 2/2) an d the measure fj-i,> on R 2d be defined as the 
push forward of u x+h4jX = v T -i^ by T h , i.e. 

t f f Vi — Vi 

f(z,y)v T h _ 1{ ^ x) (dzdy)= / / /( — - — ,y 2 )u x+h ^ x (dyidy 2 ). 

Clearly uJL lr , . is a Levy measure with a finite second moment whenever this is the case 

for i^+h^ x . The relevant smoothness of v will be defined now as the existence of the weak 
limit 




i.e. 



lim y J g{yi,y2)^i^ x) {dyidy 2 ) = J J g{y u y 2 )D i v x {dy 1 dy 2 ), ^f^i G C(K 2d ^ 



To see the rational behind this definition observe that if u(x, .) = v Fx with a given v and 
a family of transformations F x (.), then 

= ^,VF*(.)),F X (.) 

is the push forward of z/ with respect to y 1— > (£, VF,^?/)), F x {y) (V is the derivative with 
respect to x). On the other hand, if u Z)X has a density, i.e. v z ^ x {dy\dy 2 ) = v z ^ x {y\, y2)dy\dy 2 . 
then .D^z/e has the density 

lim h d u x+h4x (y + hz,y). 

If the coupling is optimal (is given by minimizers in the definition of the W 2 - distance) 
this derivative is connected with the derivative of W 2 via the formula 

// d x 2 

\z\ 2 D^u x (dzdy) = ( — \ h=0 W 2 (v{x + h^; .), u(x; .)) 

We shall need further only the partial derivatives D % v x = D ei v x in the directions of the 
co-ordinate vectors e^. The reason for introducing these derivatives lies in the observation 
that its action on Levy measures corresponds to the derivation of Levy processes. More 
precisely, the following holds. 

Proposition A. 3 Let Y t (z) be the family of the Levy processes in H d , z G H d , specified 
by their generators (1501) . Suppose G(x),b(x) G C 1 (R d ) and v{x, .) is smooth in the above 
sense (i.e. D^v are well defined with respect to a certain coupling), (i) Then the coupled 
random variables in R 2d 

(h-'iYtix + he,) -Y t (x)),Y t (x)) 
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in R 2d has a weak limit that we denote (¥ jY t {x) ,Y t {x)) and that has the distribution 
Q l Div{x) °f the Levy process at time t with the characteristic exponent 



V^G{x)q+ y/G(x)p\ +i(Vjb(x),q)+i{b{x),p) 
J ^ e i q z+ip y _ 1 _ ipy _ i qz ))D j u x (dzdy). 



(51) 



(ii) Moreover, if g G (R ) ; then the partial derivatives VjEg(x,Y t (x)) exist and 

>dg, 



V j Eg(x,Y t (x)) = J \^ J g(x,y) + (^-(x,y),z)jQ t DJu(x) (dzdy) (52) 
(V j means the derivative with respect to the variable x). 

Proof, (i) The characteristic exponent of the Levy process Th(Y t (T^ (e-j, x)) is 

Q 



vi' h ((l,P) 



G{x + h ej )l + ^/G{x)(p-^) 



1 Q 
+ i(b(x + hej), -) + i(b(x), (p - -) 



/ (eW/Wwd-ff/*) _ i _ iiyi _ y 2 )l _ ip y 2 )y T . 1{ej x) (d yi dy 2 



which clearly converges to (15"T1) . 
(ii) One has 



^ [Eg(x + hej, Y t (x + hej)) - Eg(x, Y t (x))} = -E [g(x + he j} Y t (x + hej)) - g{x, Y t (x))} , 

where the last expectation can be taken with respect to any coupling of Y t (x + hej) and 
YAx). Hence it can be written as 




V7 ( \ . f 9 9( n 2/1 -2/2- 
Vjg{x, y 2 ) + 2/2), — ^ — , 



0(l)-(h 2 e + (2/i - 2/ 2 ) 2 ) pU^AdyM- 



By the property of the coupling (Proposition IA.lT) the term with 0(1) tends to zero as 
h — > 0. Consequently 



— \ h=0 Eg(x + hej, Y t (x + hej)) = / Vjg(x, y)P t x (dy) 
>dg, 



+ lim 




dy 



v^2/2j 



2/i ~ 2/2 
h 



) pt x+he 1 A d y^ 



implying (j32J) due to statement (i). 

It is worth noting that statement (ii) implies that the distributions of the derivatives 
actually do not depend on coupling. 

So far we have got only partial derivatives. We are now interested in their continuity 
which clearly is linked to the continuity of the measures D l u x . It turns out that the 
relevant notion of continuity is a bit finer than the ^-continuity used above. Next two 
statements reveal two 'crucial bits' of this continuity. 
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Proposition A. 4 Under the assumptions of Proposition \A.3\ assume additionally that 
G(x),b(x) E C\ ip {R, d ) and that the Levy measures D^v x are Lipschitz continuous in the 
following sense: for any Xi,x 2 E R d and j = 1, d there exists a Levy coupling D^(xi,x 2 ) 
of the Levy measures D^u xi , D^u X2 such that 

/ [(l/i - 2/2) 2 (l + z\ + z\) + (z\ - z 2 ) 2 ] D J (x 1 ,x 2 )(dz 1 dz 2 dy 1 dy 2 ) < - x 2 f (53) 

with a constant k. Let Q t D j^ Xl X2 \ denote the distribution at time t of the Levy process that 
couples (VjY t (xi),Y t (xi)) and (VjY t (x 2 ),Y t (x 2 )) according to Proposition \A.ll i.e. the 
Levy process in R 4d specified by the characteristic exponent 

Vl 1 , X2 ( ( li, ( l2,Pi,P2) = ~- V jV / G'(xi)gi + V jV / G{x 2 )q 2 + y f G{xi)p l + \/ G(x 2 )p 2 
+i(V j b(xi)q 1 + Vjb(x 2 )q 2 + b{x 1 )p 1 + b(x 2 )p 2 ) 
+ j (gto+ta-^B+us,, _ ! _ i{ y ipi + y2p2 + ZlQl + Z2 q 2 )Di uX2 (dz 1 dz 2 dy 1 dy 2 ). 

Then for any £ E H d 

/ [(£ + yi - 2/ 2 ) 2 (l + ^i) 2 + (*i - ^ 2 ) 2 ] X2 )(dzidz 2 d yi dy 2 ) < £ 2 +ct{£ 2 +(x l -x 2 ) 2 ) 

(54) 

TOt/j a constant c uniformly for finite times, and for any g E Cj jip (R 2d ) the function 
Eg(x,Y t (x)) belongs to C l Li „(R. ) (also uniformly for finite times). 

Proof. The moment estimates (1541) are obtained directly from the derivatives of the 
characteristic function as in Proposition IA.1I For the second statement we write 

iVjEgixuYtixi)) - VjE0(zi,y t (a;i))| < / Qd Hxi x Adz x dz 2 dy x dy 2 ) 

Jn 4d 

/ \ __ / N ,dg, s s f dg, N 
x l Vjg{xi,yi) -\/ j g{x 2 ,y2) + (g-(zi,2/i),zi) + {-^{x 2 ,y 2 ), z 2 )\ 

(the derivative Vj with respect to x), which does not exceed 

/ ((\xi - x 2 | + - 2/1 1 (1 + | + 1) + \zi - zzDQtffa X2) (dzidz 2 dyidy 2 ), 

and which in turn does not exceed \ft\x\ — x 2 \ due to fl54|) and the Holder inequality. 

In case u(x, .) = v Fx for a family of transformations F x (.) the coupling D^(xi, x 2 ) can 
be obtained as 

/C*i, l/i, 2/2)-D J (a?i, x 2 ) {dz x dz 2 dy x dy 2 ) 



f(V j F(x 1 ,y), VjF(x 2 , y),F(xi, y), F(x 2 , y))u{dy), 

and the condition (|53|) is fulfilled whenever the derivatives Jj^F(x,y) are bounded and 
Lipschitz continuous. 

By Du x we shall denote the vector {D^v x } and by the vector {Q^^m}, j = 

1 <l- 
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Proposition A. 5 Under the assumptions of Proposition \A.3\ assume additionally that 
G(x),b(x) G C^ ip (H d ) and that the function 

J y — V2, e^e*™ - l)u x>z (d yi dy 2 ) 
is differentiable in x around x = z with uniform estimates, more precisely that 

J j(yi- V2, e 3 ){e^ - l)v x , z (d yi dy 2 ) 

d , 



x=z 




(2/i - 2/2, e 3 W mV ~ l)vx,z(dyidy2), x-z)+0(l + \p\)(x - z) 2 . (55) 



dx 

Then for a continuous function g represented via the inverse Fourier transform as 

9{y) = J e^g(p) dp, (1 + \p\)g(p) e (L 1 (R d )) d , 
one has the estimate 

E(Y t (x) - Y t (z),g(Y t (z))) = I (y x - y 2 , g(y 2 ))P: z (dy l dy 2 ) 




>, g(y))(Q t Dv{z) (dwdy), x-z) + 0(t)(x - zf 1(1 + \p\)\g(p)\ dp. (56) 
Proof. Comparing the r.h.s of ( 1551) with the definition of Dv x yields 

(y 1 -y 2 ,e j )(e i ^-l)u x>z (dy 1 dy 2 ) 





(w, ej )(e iyp - l)Dv x (dwdy),x - z ) + 0(1 + \p\)(x - zf. (57) 



Now one has 

' r (yi-y2,e j )e i ^Pl z (dy 1 dy 2 ) 

d d 
-i— | 9=0 Eexp{i(Y t (x) - Y t (z))q + iY t (z)p} = \ q =o exp{tr] x>z (q,p - q)} 



- \/G(I))p) j + (6(x) - b(z)y + / (2/1 - 2/ 2 ) J (e tpy2 - 1K*(^2/1« 



< Q(V(G(^' + W(z) + j j w 3 (e ipy - l)(Dv x (dwdy),x- z\ Ee iYt{z)p +0(t)(l+\p\)( 



x—z) 



2 



Consequently, 

(2/i - 2/2, g(y 2 ))Pt z (dy 1 dy 2 ) 



t J (^-V(G(z)p,g(p)) + V(b(z),g(p)) + J j ' (w, g(p))(e ipy - l)D Ux (dwdy), x - Ee^'dp 
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+0(t) J (l + \p\)g(p)dp(x-z) 2 . 



Similarly 




d 



1 9=0 



e^ V {t[-]^{V^Gl^q+^Gjz)p\ 2 +i{Vb{z)q+b{z) + J J (e iqw+ipy -l-ipy-iqw)Dv x (dwdy)} 



t(-V(G(z) P y +VV(z) + 




{e ipy - l)Dv x (dwdy) ) Ee iYt(z)p 



implying ( 1561) . 

To differentiate the Levy process for the second time, one needs of course the 'second 
derivative' of the Levy measure defined similarly to the first one. Namely, one needs the 
existence of the limit 



lim / /( — , Z2 , Vl V2 , z 2 ,y 2 )D J (x+he k , x)(dz 2 dz 1 dy 2 dy 1 ] 
h-*o Inn 



f(w, z k , Zj, y)D k J (dwdzjdz k dy) 
(58) 



whenever f(w, z k , Zj, y) / (u> 2 + z 1 - + z\ + y 2 ) G C(R 4d ) with D%> (dwdzjdz k dy) belonging to 
A4 2 (Ti 4:d ). The following is a straightforward analog of Proposition IA. 31 



Proposition A. 6 Under the assumptions of Proposition DOl assume that G(x),b(x) G 
C 2 (R d ) and the measures D k J G M. 2 {Yi M ) are well defined by fl58|) . (i) Then for any j, k 
the process 

(VfcV^Or), V k Y t (x), VjY t (x), Y t (x)) 

is defined weakly in R 4d and has the distribution Q l D jk v ^ of the Levy process at time t 
with the characteristic exponent 



V^ir, qk,Qj,p) 



"2 [VfcVjV^Mg + V k y^G(x)q k + V 3 ^G{x)q 3 + ^G{x)p 

+i[{V k V j b{x),q) + {V k b{x),q k ) + (V^x)^-) + (K x )p)) 
+ / \ e irw+i qk z k +i qj z j+ ipy _ 1 _ ^ rw + QkZk + q , z , + p y)]D k J{dwdz k d Zj dy). (59) 

(ii) Moreover, if g G C 2 ip (Tl d ), the partial derivatives V k V jF*g(x + Y t (x)) exist and 
W k WjEg(x + Y t {x)) = J Qpkj^^dwdzkdzjdy) 



V m V^(x + y)(8? + z^){8] + z\) + Vg(x + y)w l 



'm=l 



1=1 



(60) 



(V means the derivative with respect to the variable x). 



Acknowledgments. The author is grateful to the referee whose constructive critique 
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